AITopics

2504.04767

Country: Europe > France (0.15)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

arXiv.org Artificial IntelligenceAug-13-2024

NL2OR: Solve Complex Operations Research Problems Using Natural Language Inputs

Li, Junxuan, Wickman, Ryan, Bhatnagar, Sahil, Maity, Raj Kumar, Mukherjee, Arko

Operations research (OR) uses mathematical models to enhance decision-making, but developing these models requires expert knowledge and can be time-consuming. Automated mathematical programming (AMP) has emerged to simplify this process, but existing systems have limitations. This paper introduces a novel methodology that uses recent advances in Large Language Model (LLM) to create and edit OR solutions from non-expert user queries expressed using Natural Language. This reduces the need for domain expertise and the time to formulate a problem. The paper presents an end-to-end pipeline, named NL2OR, that generates solutions to OR problems from natural language input, and shares experimental results on several important OR problems.

arxiv preprint arxiv, language model, nl2or, (15 more...)

2408.07272

Country:

North America > United States > Washington > King County > Redmond (0.04)
North America > United States > Tennessee > Shelby County > Memphis (0.04)
North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

arXiv.org Artificial IntelligenceNov-9-2023

CloudEval-YAML: A Practical Benchmark for Cloud Configuration Generation

Xu, Yifei, Chen, Yuning, Zhang, Xumiao, Lin, Xianshang, Hu, Pan, Ma, Yunfei, Lu, Songwu, Du, Wan, Mao, Zhuoqing, Zhai, Ennan, Cai, Dennis

Among the thriving ecosystem of cloud computing and the proliferation of Large Language Model (LLM)-based code generation tools, there is a lack of benchmarking for code generation in cloud-native applications. In response to this need, we present CloudEval-YAML, a practical benchmark for cloud configuration generation. CloudEval-YAML tackles the diversity challenge by focusing on YAML, the de facto standard of numerous cloud-native tools. We develop the CloudEval-YAML benchmark with practicality in mind: the dataset consists of hand-written problems with unit tests targeting practical scenarios. We further enhanced the dataset to meet practical needs by rephrasing questions in a concise, abbreviated, and bilingual manner. The dataset consists of 1011 problems that take more than 1200 human hours to complete. To improve practicality during evaluation, we build a scalable evaluation platform for CloudEval-YAML that achieves a 20 times speedup over a single machine. To the best of our knowledge, the CloudEval-YAML dataset is the first hand-written dataset targeting cloud-native applications. We present an in-depth evaluation of 12 LLMs, leading to a deeper understanding of the problems and LLMs, as well as effective methods to improve task performance and reduce cost.

application, cloudeval-yaml, dataset, (13 more...)

2401.06786

Country:

North America > United States > Michigan (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (1.00)

Industry: Information Technology > Services (0.46)

Technology:

Information Technology > Cloud Computing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.73)

Chanus, Thibault, Aubertin, Michael

LLM and Infrastructure as a Code use case

arXiv.org Artificial IntelligenceNov-2-2023

Cloud computing and the evolution of management methodologies such as Lean Management or Agile entail a profound transformation in both system construction and maintenance approaches. These practices are encompassed within the term "DevOps." This descriptive approach to an information system or application, alongside the configuration of its constituent components, has necessitated the development of descriptive languages paired with specialized engines for automating systems administration tasks. Among these, the tandem of Ansible (engine) and YAML (descriptive language) stands out as the two most prevalent tools in the market, facing notable competition mainly from Terraform. The current document presents an inquiry into a solution for generating and managing Ansible YAML roles and playbooks, utilizing Generative LLMs (Language Models) to translate human descriptions into code. Our efforts are focused on identifying plausible directions and outlining the potential industrial applications. Note: For the purpose of this experiment, we have opted against the use of Ansible Lightspeed. This is due to its reliance on an IBM Watson model, for which we have not found any publicly available references. Comprehensive information regarding this remarkable technology can be found [1] directly on our partner's website, RedHat.

ansible, builtin, restart, (16 more...)

2309.01456

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > France (0.04)

Genre: Research Report > Promising Solution (0.34)

Industry: Information Technology (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

#artificialintelligenceApr-7-2023, 00:25:56 GMT

Build Reliable Machine Learning Pipelines with Continuous Integration

As a data scientist, you are responsible for improving the model currently in production. After spending months fine-tuning the model, you discover one with greater accuracy than the original. Excited by your breakthrough, you create a pull request to merge your model into the main branch. Unfortunately, because of the numerous changes, your team takes over a week to evaluate and analyze them, which ultimately impedes project progress. Furthermore, after deploying the model, you identify unexpected behaviors resulting from code errors, causing the company to lose money.

pipeline, remote storage location, workflow, (11 more...)

Technology:

Information Technology > Data Science (0.87)
Information Technology > Artificial Intelligence > Machine Learning (0.71)

#artificialintelligenceJan-17-2023, 19:45:26 GMT

A step-by-step guide to using MLFlow Recipes to refactor messy notebooks

Code repository for this post is here: you can see the MLFlow Recipes template in the main branch and the filled-in template on the fill-in-steps branch. The announcement of MLFlow 2.0 included a new framework called MLFlow Recipes. For a Data Scientists, using MLFlow Recipes means cloning a git repository, or "template", that comes with a ready-to-go folder structure for any regression or binary classification problem. This folder structure includes everything, from library requirements, configuration, notebooks and tests, that's needed to make a data science project reproducible and production-ready. It's easy to start a new project with MLFlow Recipes -- git clone a template from the MLFlow repository, and you are good to go.

artificial intelligence, machine learning, mlflow recipe, (16 more...)

Genre:

Workflow (0.86)
Instructional Material > Training Manual (0.40)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceJan-12-2023, 13:45:14 GMT

Ray Will Dominate

My conviction for a product has never been so high for any ML Ops framework as for Ray. Let's be honest, most "ML Ops" libraries suck, not just suck they will probably slow down your ML scientists and data scientists vs not even using anything. Occasionally (surprisingly quite often now), someone asks me about ray and why I think is going to win. I spent plenty of hours trying to formalize my thoughts and here is a summary of it. In layman's terms, through a set of beautifully designed libraries and easy-to-use decorators (@ray.remote),

artificial intelligence, machine learning, ray, (15 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.50)

#artificialintelligenceNov-27-2022, 19:30:52 GMT

Turn VS Code into a One-Stop Shop for ML Experiments

One of the biggest threats to productivity in recent times is context switching. It is a term originating from computer science but applied to humans it refers to the process of stopping work on one thing, performing a different task, and then picking back up the initial task. During a work day, you might want to check something on Stack Overflow, for example, which normalization technique to choose for your project. While doing so, you start exploring the documentation of scikit-learn to see which approaches are already implemented and how they compare against each other. This might lead to you some interesting comparison articles on Medium or video tutorials on YouTube.

dvc, experiment, extension, (15 more...)

Genre:

Instructional Material (0.54)
Workflow (0.50)

Technology:

Information Technology > Communications > Social Media (0.69)
Information Technology > Artificial Intelligence (0.48)

#artificialintelligenceMay-21-2022, 05:00:19 GMT

Bea Stollnitz - Choosing the compute for Azure ML resources

When training a machine learning model or deploying it to an endpoint, you'll need to choose an appropriate machine to run it. I'll use the term "compute" to refer to the virtual machine (or set of machines) that runs your code in the cloud. The goal of this blog post is to give you an overview of all the compute options available to you in Azure ML, so that you can choose an appropriate option for your scenario. I'll assume that you're already familiar with the basic concepts of Azure ML, and that you have some experience using it for your own projects. Throughout this post, I'll discuss the following three major compute types available in Azure ML: I'll also briefly mention the available VM sizes, including how to get more quota for a particular VM size.

azure ml studio, compute cluster, yaml file, (12 more...)

Technology:

Information Technology > Cloud Computing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceApr-18-2022, 19:57:01 GMT

Sparse Transformers

Originally published on Towards AI the World's Leading AI and Technology News and Media Company. If you are building an AI-related product or service, we invite you to consider becoming an AI sponsor. At Towards AI, we help scale AI and technology startups. Let us help you unleash your technology to the masses. If you want to analyze how fast 19 sparse BERT models perform inference, you'll only need a YAML file and 16GB of RAM to find out.

bert model, sparse transformer, yaml file, (16 more...)

Technology: Information Technology > Artificial Intelligence (1.00)